TO APPEAR : SDAIR 1995 Generating Synthetic

نویسندگان

  • David Doermann
  • Shee Yao
چکیده

In this paper we describe work on a system for modeling errors in the output of OCR systems. The project is motivated by the desire to evaluate the performance of various text analysis systems under varying, yet controlled conditions. We describe a set of symbol and page models which are used to degrade an ideal text by introducing errors which typically occur during scanning, decomposition and recognition of document images. A rst generation of the software is described which implements the page models and allows the use of transition probabilities , either extracted from real data or generated synthetically, to corrupt text.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generating the synthetic CT (sCT) and synthetic MR (sMR: sT1w/sT2w) images of the brain using atlas based method

Introduction: Radiation therapy planning (RTP) is one of the clinical applications in which both CT scan and MRI are used. MR and CT images are applied to determine the target volume and calculation of dose distribution, respectively. In addition, using two imaging modalities increases the department workload and cost. In this study, an algorithm was presented to create synthet...

متن کامل

Generating Synthetic Computed Tomography and Synthetic Magnetic Resonance (sMR: sT1w/sT2w) Images of the Brain Using Atlas-Based Method

Introduction: Nowadays, magnetic resonance imaging (MRI) in combination with computed-tomography (CT) is increasingly being used in radiation therapy planning. MR and CT images are applied to determine the target volume and calculate dose distribution, respectively. Since the use of these two imaging modalities causes registration uncertainty and increases department w...

متن کامل

Generating Representative Synthetic Workloads An Unsolved Problem

Synthetic disk request traces are convenient and popular workloads for performance evaluation of storage subsystem designs and implementations. This paper develops an approach for validating synthetic disk request generators. Using this approach, commonly-used simplifying assumptions about workload characteristics (e.g., uniformly-distributed starting addresses and Poisson arrivals) are shown t...

متن کامل

A Hybrid Learning Model of Abductive Reasoning

Multicausal abductive tasks appear to have deliberate and implicit components: people generate and modify explanations using a series of recognizable steps, but these steps appear to be guided by an implicit hypothesis evaluation process. This paper proposes a hybrid symbolic-connectionist learning architecture for multicausal abduction. The architecture tightly integrates a symbolic Soar model...

متن کامل

Real-time Integration of Synthetic Computer Graphics into Live Video Scenes

In commercials and motion pictures, computer graphics is often used to achieve special effects, e.g., adding synthetic dinosaurs in “Jurassic Park”. This process usually implies special camera equipment and a careful and time consuming post processing of single frames. We consider a simplified scenario, where synthetic objects are added automatically to a live scene in real-time. Reference poin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995